Using speech rhythm for acoustic language identification

نویسندگان

  • Ekaterina Timoshenko
  • Harald Höge
چکیده

This paper presents results on using rhythm for automatic language identification (LID). The idea is to explore the duration of pseudo-syllables as language discriminative feature. The resulting Rhythm system is based on Bigram duration models of neighbouring pseudo-syllables. The Rhythm system is fused with a Spectral system realized by parallel Phoneme Recognition (PPR) approach using MFCC’s. The LID systems were evaluated on a 7 languages identification task using the SpeechDat II databases. Tests were performed with 7 seconds utterances. Whereas the Spectral system acting as a baseline system achieved an error rate of 7.9 % the fused system reduced the error rate by 10 % relatively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rhythmic unit extraction and modelling for automatic language identification

This paper deals with an approach to automatic language identification based on rhythmic modelling. Beside phonetics and phonotactics, rhythm is actually one of the most promising features to be considered for language identification, even if its extraction and modelling are not a straightforward issue. Actually, one of the main problems to address is what to model. In this paper, an algorithm ...

متن کامل

Deep Neural Network Bottleneck Features for Acoustic Event Recognition

Bottleneck features have been shown to be effective in improving the accuracy of speaker recognition, language identification and automatic speech recognition. However, few works have focused on bottleneck features for acoustic event recognition. This paper proposes a novel acoustic event recognition framework using bottleneck features derived from a Deep Neural Network (DNN). In addition to co...

متن کامل

Language identification with suprasegmental cues: a study based on speech resynthesis.

This paper proposes a new experimental paradigm to explore the discriminability of languages, a question which is crucial to the child born in a bilingual environment. This paradigm employs the speech resynthesis technique, enabling the experimenter to preserve or degrade acoustic cues such as phonotactics, syllabic rhythm, or intonation from natural utterances. English and Japanese sentences w...

متن کامل

The Relationship Between Acoustic Characteristics and Personality Dimensions in Patients With Dysphonia

Objectives: Voice is influenced by personality. However, it is still questionable which acoustic features are influenced by personality traits. This study aimed to investigate the relationship between acoustic characteristics and personality dimensions. Methods: Thirty-three participants with dysphonia and 33 participants without dysphonia were recruited to take part in this cross-sectional st...

متن کامل

Validating Acoustic Measures of Speech Rhythm for Second Language Acquisition

This paper reports research investigating the validity of using Pairwise Variability Indexes in research into the second language acquisition of speech rhythm. Findings determined that 1) expert native-speakers rate non-native speaker rhythm based on a common factor, and 2) part of that common factor can be accounted for by the use of vocalic pairwise variability. It was concluded that the PVI ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007